Linear - to - Mel Freq . Filter Bank Conversion Fourier Transform
نویسنده
چکیده
منابع مشابه
Cepstrum derived from differentiated power spectrum for robust speech recognition
In this paper, cepstral features derived from the differential power spectrum (DPS) are proposed for improving the robustness of a speech recognizer in presence of background noise. These robust features are computed from the speech signal of a given frame through the following four steps. First, the short-time power spectrum of speech signal is computed from the speech signal through the fast ...
متن کاملFrequency warping and robust speaker verification: a comparison of alternative mel-scale representations
Accuracy of speaker verification is high under controlled conditions but falls off rapidly in the presence of interfering sounds. This is because spectral features, such as Mel-frequency cepstral coefficients (MFCCs), are sensitive to additive noise. MFCCs are a particular realization of warped-frequency representation with low-frequency focus. But there are several alternative, potentially mor...
متن کاملRobust MFCCs Derived from Differentiated Power Spectrum
The mel-scaled frequency cepstral coefficients (MFCCs) derived from Fourier transform and filter bank analysis are perhaps the most widely used front-ends in state-of-the-art speech recognition systems. One of the major issues with the MFCCs is that they are very sensitive to additive noise. To improve the robustness of speech front-ends with respect to noise, we introduce, in this paper, a new...
متن کاملImplementation of a Tree-Structured Filter Bank Using Cyclic Convolutions
A filter bank in which cyclic convolutions are used in place of linear convolutions will be referred to as a cyclic convolution filter bank (CCFB). A dyadic tree-structured CCFB can be used to perform a discrete wavelet transform suitable for coding based on symmetric extension methods. This paper derives an implementation technique for the tree-structured CCFB that can be realized by FFT proce...
متن کاملA Wavelet Packet and Mel-Frequency Cepstral Coefficients-Based Feature Extraction Method for Speaker Identification
One of the most widely used approaches for feature extraction in speaker recognition is the filter bank-based Mel Frequency Cepstral Coefficients (MFCC) approach. The main goal of feature extraction in this context is to extract features from raw speech that captures the unique characteristics of a particular individual. During the feature extraction process, the discrete Fourier transform (DFT...
متن کامل